

RAW SEQUENCE LISTING DATE: 09/26/2002 

PATENT APPLICATION: US/09/282,879 TIME: 10:50:25 

Input Set : N:\Crf3\RULE60\09282879.raw 
Output Set: N:\CRF4\09252002\I282879.raw 

SEQUENCE LISTING 

3 (1) GENERAL INFORMATION: 

5 (i) APPLICANT: Chatterjee, Subroto 

7 (ii) TITLE OF INVENTION: RECOMBINANT N-SMASES AND NUCLEIC ACIDS 

8 ENCODING SAME 
10 (iii) NUMBER OF SEQUENCES: 7 

12 (iv) CORRESPONDENCE ADDRESS: 

13 (A) ADDRESSEE: Dike, Bronstein, Roberts & Cushman, LLP 

14 (B) .STREET: 130 Water Street 

15 (C) CITY: Boston 

16 (D) STATE: MA 

17 (E) COUNTRY: USA / 

18 (F) ZIP: 02109 

20 (V) COMPUTER READABLE FORM: 

21 (A) MEDIUM TYPE: Diskette 

22 (B) COMPUTER: IBM Compatible 

23 (C) OPERATING SYSTEM: DOS 

24 (D) SOFTWARE: FastSEQ Version 1.5 
26 (vi) CURRENT APPLICATION DATA: 

C--> 27 (A) APPLICATION NUMBER: US/09/282,879 

C--> 28 (B) FILING DATE: 31-Mar-1999 

29 (C) CLASSIFICATION: 

31 (vii) PRIOR APPLICATION DATA: 

32 (A) APPLICATION NUMBER: US/08/774,104 

33 (B) FILING DATE: 

35 (viii) ATTORNEY/AGENT INFORMATION: 

36 (A) NAME: Corless, Peter F 

37 (B) REGISTRATION NUMBER: 33,860 

3 8 (C) REFERENCE/DOCKET NUMBER: 4 6906 

40 (ix) TELECOMMUNICATION INFORMATION: 

41 (A) TELEPHONE: 617-523-3400 

42 (B) TELEFAX: 617-523-6440 

4 3 (C) TELEX: 
4 6 (2) INFORMATION FOR SEQ ID NO : 1: 
4 8 (i) SEQUENCE CHARACTERISTICS: 

49 (A) LENGTH: 1197 base pairs 

50 (B) TYPE: nucleic acid 

51 (C) STRANDEDNESS: single 

52 (D) TOPOLOGY: linear 

54 (ii) MOLECULE TYPE: cDNA 

55 (iii) HYPOTHETICAL: NO 

56 (iv) ANTI- SENSE: NO 
W--> 57 (V) FRAGMENT TYPE: 
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58 (vi) ORIGINAL SOURCE: 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

62 ATGATGACAT ATCACGAAAC GCGCGCGTTG GCTCAAAGCG ACTTACAGCA ACTCTATGCG 60 

63 GCACTTGAAA CAACTGAATT TGGCGCTTAC TTTGCGACAC CCGCTGATGA TACTTTACGT 120 

64 TTTGGCATTG GCGCAATCGC TACGGCAAAA ACGGCTCAGG CATTACAAGG TGCGGTTGTT 180 

65 TTTGGTGCGC AGTCATTTGA TGAACAAGAG TACCCGCAGT CTGAATTGAT GGCGGGTTTT 24 0 

66 TGGTTTGTCC CCGAAGTGAT GGTGACCATC GCGGCAGATA AAATCACGTT CGGATCAGAT 300 

67 ACCGTATCTG ATTTTACGAC GTGGCTGGCG CAGTTCGTGC CAAAACAGCC AAATACGGTG 360 

68 ACCACTAGTC ATGTGACAGA TGAAGTGGAT TGGATCGAAC GGACAGAGAA TTTGATTGAT 4 20 

69 ACCTTAGCCA TCGATCAAAC CTTAGCCAAA GTCGTTTTTG GTCGGCAACA GACCCTGCAG 4 80 

70 TTATCCGACA CGTTACGACT GGCACAAATT ATTCGTGCGT TAGCTGAGCA GGCGAATACG 54 0 

71 TATCATGTGG TTTTAAAGCG ACATGATGAA TTGTTTATTT CAGCAACACC GGAACGGTTA 600 

72 GTGGCTATGT CAGGTGGTCA GATCGCTACG GCGGCGGTCG CTGGGACAAG CCGGCGCGGG 660 

73 ACGGATGGCG CTGACGATAT CGCGTTAGGC GAAGCGTTGT TAGCCAGTCA GAAAAACCGC 72 0 

74 ATTGAACATC AATATGTCGT GGCAAGTATC ACGACACGCT TGCAAGACGT GACGACGTCG 780 

75 CTAAAGGTGC CGGCCATGCC AAGTTTACTC AAAAATAAGC AAGTTCAGCA TTTGTACACA 840 

76 CCAATTACAG GGGACATTGC GGCACATTTA AGTGTGACCG CGATTGTTGA CCGCTTGCAT 900 

77 CCAACACCAG CACTGGGTGG CGTCCCACGT GAAGCGGCCC TGTATTACAT TGCGACCCAT 960 

78 GAGAAGACAC CTCGTGGCTT GTTTGCAGGT CCTATTGGCT ATTTTACCGC AGATAATAGT 1020 

79 GGGGAATTTG TGGTTGGCAT CCGTTCCATG TATGTGAATC AAACGCAGCG ACGAGCAACT 1080 

80 TTATTTGCTG GTGCCGGGAT TGTGGCTGAC TCCGATGCGC AACAAGAATA TGAAGAAACT 1140 

81 GGGTTGAAAT TTGAACCCAT GCGGCAATTG TTAAAGGACT ACAATCATGT CGAATGA 1197 
84 (2) INFORMATION FOR SEQ ID NO : 2: 

86 (i) SEQUENCE CHARACTERISTICS: 

87 (A) LENGTH: 397 amino acids 

88 (B) TYPE: amino acid 

89 (C) STRANDEDNESS : single 

90 (D) TOPOLOGY: linear 

92 (ii) MOLECULE TYPE: protein 

93 (iii) HYPOTHETICAL: NO 

94 (iv) ANTI-SENSE: NO 

95 (v) FRAGMENT TYPE: N- terminal 

96 (vi) ORIGINAL SOURCE: 

98 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 

100 Met Met Thr Tyr His Glu Thr Arg Ala Leu Ala Gin Ser Asp Leu Gin 

101 15 10 15 

102 Gin Leu Tyr Ala Ala Leu Glu Thr Thr Glu Phe Gly Ala Tyr Phe Ala 

103 20 25 30 

104 Thr Pro Ala Asp Asp Thr Leu Arg Phe Gly lie Gly Ala lie Ala Thr 

105 35 40 45 

106 Ala Lys Thr Ala Gin Ala Leu Gin Gly Ala Val Phe Gly Ala Gin Ser 

107 50 55 60 

108 Phe Asp Glu Gin Glu Tyr Pro Gin Ser Glu Leu Met Ala Gly Phe Trp 

109 65 70 75 80 

110 Phe Val Pro Glu Val Met Val Thr lie Ala Ala Asp Lys lie Thr Phe 

111 85 90 95 

112 Gly Ser Asp Thr Val Ser Asp Phe Thr Thr Trp Leu Ala Gin Phe Val 

113 100 105 110 

114 Pro Lys Gin Pro Asn Thr Val Thr Thr Ser His Val Thr Asp Glu Val 
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172 (2) INFORMATION FOR SEQ ID NO: 4: 

174 (i) SEQUENCE CHARACTERISTICS: 

175 (A) LENGTH: 7 amino acids 

176 (B) TYPE: amino acid 

177 (C) STRANDEDNESS: single 

178 (D) TOPOLOGY: linear 

180 (ii) MOLECULE TYPE: protein 

181 (iii) HYPOTHETICAL: NO 

182 (iv) ANTI-SENSE: NO 

183 (V) FRAGMENT TYPE: N-terminal 

184 (vi) ORIGINAL SOURCE: 

186 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

188 Arg Ser He Thr Val Arg Val 

189 1 5 

192 (2) INFORMATION FOR SEQ ID NO : 5: 

194 (i) SEQUENCE CHARACTERISTICS: 

195 (A) LENGTH: 18 base pairs 

196 (B) TYPE: nucleic acid 

197 (C) STRANDEDNESS: single 

198 (D) TOPOLOGY: linear 

200 (ii) MOLECULE TYPE: cDNA 

201 (iii) HYPOTHETICAL: NO 

202 (iv) ANTI-SENSE: NO 
W--> 203 (V) FRAGMENT TYPE: 

204 (vi) ORIGINAL SOURCE: 

206 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

208 TTGCGGCACT ATTAGGTG 18 

211 (2) INFORMATION FOR SEQ ID NO : 6: 

213 (i) SEQUENCE CHARACTERISTICS: 

214 (A) LENGTH: 18 base pairs 

215 (B) TYPE: nucleic acid 

216 (C) STRANDEDNESS: single 

217 (D) TOPOLOGY: linear 

219 (ii) MOLECULE TYPE: cDNA 

220 (iii) HYPOTHETICAL: NO 

221 (iv) ANTI-SENSE: NO 
W--> 222 (V) FRAGMENT TYPE: 

223 (vi) ORIGINAL SOURCE: 

225 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

227 CGCCAATGCC AAAACGTA 18 

230 (2) INFORMATION FOR SEQ ID NO : 7: 

232 (i) SEQUENCE CHARACTERISTICS: 

233 (A) LENGTH: 50 base pairs 

234 (B) TYPE: nucleic acid 

235 (C) STRANDEDNESS: single 

236 (D) TOPOLOGY: linear 

238 (ii) MOLECULE TYPE: cDNA 

239 (iii) HYPOTHETICAL: NO 

240 (iv) ANTI-SENSE: NO 



file://C:\CRF4\Outhold\VsrI282879.htm 



9/26/02 



Page 5 of 7 



RAW SEQUENCE LISTING DATE: 09/26/2002 

PATENT APPLICATION: US/09/282,879 TIME: 10:50:25 
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W--> 241 (V) FRAGMENT TYPE: 

24 2 (vi) ORIGINAL SOURCE: 

244 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

246 GATCCATGAT GACATATCAC GAAACGCGCG TTTCGTGATA TGTCATCATG 50 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/282,879 



DATE: 09/26/2002 
TIME: 10:50:26 



Input Set : N:\Crf3\RULE60\09282879.raw 
Output Set: N:\CRF4\09252002\l282879.raw 



L:27 M:220 C: Keyword misspelled or invalid format, [(A) APPLICATION NUMBER:] 

L:28 M:220 C: Keyword misspelled or invalid format, [(B) FILING DATE:] 

L:57 M:246 W: Invalid value of Alpha Sequence Header Field, [FRAGMENT TYPE:], SeqNo=l 

L:203 M:246 W: Invalid value of Alpha Sequence Header Field, [FRAGMENT TYPE:], SeqNo=5 

L:222 M:246 W: Invalid value of Alpha Sequence Header Field, [FRAGMENT TYPE:], SeqNo=6 

L:241 M:246 W: Invalid value of Alpha Sequence Header Field, [FRAGMENT TYPE:], SeqNo=7 
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